Data in Enterprise OMOP
Emory's OMOP Enterprise pipeline transforms clinical data from Epic and the Clinical Data Warehouse (CDW) into the OMOP Common Data Model. This section covers what's in the data, how it got there, and how we know it's right.
-
Data Mapping
How source data flows from Epic and CDW into OMOP — the ELT pipeline, vocabulary mapping coverage, and custom concepts.
-
Data Quality
Automated quality checks across 2,374 DQD tests (96.6% pass rate), 133 DBT tests, and a tracked list of known issues.
-
Observed Conventions
OHDSI community conventions, Emory-specific conventions, and documented adherence to standards across the pipeline.
-
NLP Infrastructure
Proposed span-based NLP schema extending the OMOP CDM — pipeline provenance, typed extractions, and
_DERIVEDtables for clean separation of NLP-derived data. -
Releases
Version history from v0.2.0 through v1.0.0 — what changed, what was fixed, and what researchers should know.
Data Mapping at a Glance
| Area | Pages |
|---|---|
| Pipeline | Extract Load Transform (ELT) · Era Algorithms |
| Coverage | Vocabulary Mapping Coverage |
| Extensions | Custom Concepts · Requesting Mappings · Contributing Vocabularies |